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SEQUENTIAL RANKING PROCEDURES 
By 
Klias Alphonse Parent, Jr. 


1. Introduction. Many statistical procedures used and studied 
today are sequential in nature. By this we mean that the time when a 
statistical decision is reached is random. In contrast to such proce- 
dures are the fixed sample size procedures. Best known perhaps is 
sequential analysis and the sequential probability ratio test as formu- 
lated by Wald [6]. There are other sequential procedures, for example 
in process inspection schemes, where, based on a sequence of observations 
a decision is made to stop the process and take some adjusting action, 
the time at which the process is stopped being a random variable. There 
are many other sequential-like procedures. 

In the theory of hypothesis testing for the case of a simple hypo- 
thesis against a simple alternative it is known that a most powerful test 


can be determined by the Neyman-Pearson lemma, which is of the form: 


. nn ho £, (% Koy seep x) _ 
me7jece = Es ee x, Xs ee x 
where the hypotheses to be tested are f = fo against f = fs Lo and 


f are the joint densities of the observations X X 


1 ye Xora ttt x corre- 


sponding to each hypothesis. This is an example of a nonsequential 
procedure. To extend such a procedure to the sequential idea we need 
only modify the test as follows: 

take a sample of size of size m and 


K 


reject f af A 7 


O 


accept fo iat A = K, 


IV 


“ 


draw another sample of size n-m if K, — A < Ky 


1 





if the second sample is required compute A and 


reject f ae AL aK 


O 
accept fi alte A Ss Kee 


Such a simple modification gives us a two stage procedure with a new 
feature in that the total sample size is random, being either m or n, 
depending upon the outcome of the first stage. This basic idea of a 
sequential test was proposed by Dodge and Romig in [8], and has been 
extended to multiple stage sampling plans. 

Sequential hypothesis testing as proposed by Wald requires that 
@ computation of AY and a decision be made as each observation is 
taken. Briefly, to test f = Py against f = rs select constants 


B< A and compute A as each observation is taken, and proceed 


according to the rule 


jak A 2 A reject f = ty 
if A, Slew dehiele 3 = t) 
i bo AL <A take another observation and compute A+] 


Since the sequential probability ratio test is formulated in 
terms of the ratio which leads to most powerful tests according to 
the Neyman-Pearson theory we would expect it to have good properties. 
This indeed is the case in that of all tests with the same power the 
sequential probability ratio test requires on the average fewest obser- 
vations. This optimal property was conjectured by Wald and finally 


proved by Wald and Wolfowitz in [9]. 





In order to carry out these sequential tests of hypotheses we note 
that an assumption as to the specific form of Po and tr must be 
made. It often happens that the form of the underlying distribution 
is not assumed known and in this case nonparametric statistical methods 


are used. In nonparametric statistics many tests of statistical hypo- 


At 


theses are based on the set of ranks {T >? 


; TJ determined 


from a random sample {X,, X mises. XJ; or the signs of the obser- 


i ee 

vations ( l according as A in positive or negative) or on a 
combination of both of these sets of statistics derived from the basic 
observations. The sign test, signed rank test, Wilcoxon-Mann-Whitney 
test, Fisher-Yates test and many others are examples of such fixed 
sample size nonparametric tests. 

Contrary to the case in parametric statistics (as opposed to non- 
parametric statistics) there are very few sequential procedures in 
nonparametric statistics, particularly sequential procedures based on 
Signs, ranks, or both. One reason for this is that for most specified 
alternatives to the null hypothesis it is difficult to compute proba- 
bilities for statistics based on signs and ranks which in turn makes 
it difficult to preperly evaluate the properties and operating charac- 
teristics of the procedures. This difficulty can be circumvented by 
restricting attenticn to special classes of alternatives such as those 
proposed by Lehmann in [1], where to the null hypothesis F(x) he 
proposed alternatives of the form F(x), a> QO. This of course does 
not solve the basic problem of alternatives as the question of whether 


or not the Lehmann alternative is appropriate for the problem being 


considered arises. However it is a first step inasmuch as it does 


y 





allow us to develop some sequential procedures where exact distribution 
theory calculations are possible. In the fixed sample size problem it 
simplifies considerations of power of rank tests. 

An example of a nonparametric sequential test is the following 
adaptation of Wald's sequential probability ratio test for binomial 
observations. Consider a sequence of independent identically distrib- 


uted random variables X x with cumulative distribution function 


ee ee 


F(t) = P(X, <t). We wish to test F( ty ) = Po against F(t) ) =P} 
for some fixed value Cys The number of observations less than or 
equal Co to» say N, after taking n observations, is a binomial 


random variable with parameters F(t) and n. The probability ratio 


reduces to 


mar r P(N| Mee) = = (7 5 ) ¢ (1 = 


n P(N] F(t.) =p.) i ap oy 


and the sequential test based on this ratio is discussed in Wald [6]. 
For the special case where to = QO, N is equivalent to the number of 
negative observations after n trials and this would be a sequential 
test based on the signs of the observations. 

An example of a nonparametric sequential procedure based on ranks 
of observations is the grouped rank test developed by Wilcoxon, Rhodes 
and Pradley [4]. Actually two sequential procedures are developed in 
(4], the Configural Rank Test and the Rank Sum Test. Basically, obser- 
vations are taken in groups of m X's and n Y's and the observations 


are ranked within each group. For each group a statistic is computed 








based on the ranks and Wald's sequential probability ratio test is 
applied to the sequence of statistics so generated. Each group of m 

X's and n Y's becomes the basic unit used in the probability ratio. 
Suppose the X- population has distribution F(x) and the Y- population 


has distribution G(y), and observations are taken as follows 


Or ee Se ey oo: ae - group l 

OE Koos ces 9 Xone Yay: Yoo. oe Yo) - group 2 

X ae ee - 
Let Ry = (Ry Ryos tet ae Sie So ees, Syn) be the rank 
vector associated with group ¥ where R_.. is the rank of X and 


Vaal Y1 


A isecne rank of Ty 49 the ranks taken from the combined ranking of 
mies x's and Y's. Takine 4 function or Ry; say T, = T(R,), we 


Ap --- and the Wald 


generate a new sequence of random variables Ty» >? 


sequential probability ratio test may now be applied to the I - For 


independent group to group sampling we have 


n P(T, | Y ~ G(y)) 
(1.2) A 


n lI P(T, | aad F(y)) 

y=L 
as the probability ratio to test the hypothesis that the Y- population 
has distribution F(y) against G(y). In [4] the authors consider 
Lehmann alternatives G(y) = F“(y), k > 0 ‘and the function Tea 


one case is the actual configuration of X's and Y's, which is 








equivalent to the vector (S S ... , S,_), and in the second 


Ti ae Yn 
case T is taken to be the sum of the Y ranks. 

Wilcoxon, Rhodes and Bradley observe that the test could be 
improved by taking observations in pairs and reranking from the begin- 
ning each time a new observation pair is taken. One reason for the 
reduced efficiency of the group ranking method is that the observations 
in one group are not compared with observations from any other group. 
The reranking suggestion would take into account all comparisons. How- 
ever, this is very cumbersome, and moreover reranking introduces non- 
independence of successive probability ratios making an analysis of the 
properties of such a procedure difficult. 

Thus in order to attack the problem of nonparametric sequential 
tests of hypotheses based on ranks we should consider procedures such 
that the distribution theory is tractable and such that ranks are 
assigned in a truly sequential manner, avoiding as much as possible 
the complexities introduced by reranking. To this end two new sequen- 
tial ranking methods will be defined in this dissertation. 

In order to be led somewhat naturally to these new ranking methods 
we now consider the reranking procedure in more detail. Let a be 


the rank of oe at the ie stage in the reranking process. We 


observe X x 4 and each time a new observation is 


1? a7 ash : n’ 


taken the entire set of observations is reranked. We have 





Observation vectors Rank vectors 


(X, ) (a) 
(X,, Kop cee y x) ae Tor tee i) 


Ee 


Notice that the vector (T ne 


WwW Ta completely deter- 
mines the n rank vectors listed above in the sense that each vector 
could be reconstructed given only I. T= ie, 2s ee T.5 is 


X 


the rank of x. relative to the set {X >? ; XJ. Thus we 


dee 
can rank an observation as it is observed, relative to the preceeding 
observations without reranking the previous observations and still 
retain the information contained in the n rank vectors which would 
come from reranking. This method of ranking observations is one way 
of assigning ranks which fits in naturally with the idea of sequential 
procedures and lends itself to developing sequential procedures in non- 
parametric problems. This ranking procedure also takes into account 
all comparisons among the observations. 

Analogous to the fixed sample size signed rank test we will define 
a second sequential ranking procedure based upon the absolute values of 
the observations and taking into account the signs of the observations. 
This signed sequential ranking procedure will be applied to a problem 


in process control. By process control we mean a procedure where the 


aim is to determine when a given sequence of random variables changes 








from being distributed according to a distribution F(x) to a different 
distribution G(x). The term process control enjoys a broader definition 
today including those cases where the process is adjusted according to 
some statistic based upon the sequence of observations. Such proce- 
dures are referred to as adaptive control methods. 

The early methods used to control a process were based on control 
charts (Shewhart charts) and modifications of these control charts. 
To control the mean value of some dimension of a process at a particular 
value oe samples of size n are taken at frequent intervals of time 
and the sample mean X is compared with a ko/vn . If X falls 
outside these lines the process is stopped and adjustments to the 
Deecess are Carried OuL, and for at ko/vyn < X < Os as ko/7n the 
process is allowed to continue without adjustment. Modifications to 
the basic control chart method came in the form of "warning lines” 
imisade the action lines anes ko/vyn . Further modifications were 
introduced which changed the action rule to rules of the type "If K 
consecutive points on the chart fall outside control lines, take action.” 
These early procedures failed to take advantage of all the information 


X 


eOntained in the sequence x protest 


e x At best the modified 


action rules used only the information contained in a fixed number of 
sample values in the immediate past. 

In order to take advantage of this unused information the stopping 
rule should incorporate the entire sample. A step in this direction 
was taken by Page in [7] with the introduction of cumulative sum 
schemes. If the mean of a process is to be controlled the cumulative 
sums S| = y (x, - k) are plotted on a chart against n. The entire 


i=l 
8 





history of the process is presented and changes in the process mean are 
visible through changes in direction of the mean path. To detect one- 
Sided deviations in the mean, say increases, the stopping rule used is 
to stop the process when the current point of the path Gor s 1) rises 
a given amount h>O above the previous lowest point of the path. 
Iwo-sided deviations are treated by applying two one-sided schemes 
simultaneously. For normal observations the cumulative sum schemes 
have been found to be more sensitive than the Shewhart control chart. 
When no assumption is made as to the form of the underlying dis- 
tributions we might look to non parametric methods for a control 
procedure. For example, the sequential rank of X. is equally likely 
to be 1, 2, ... , i as long as no change takes place in the distri- 


buGion of A Xx 


1? *o» r A. But when a location change takes place, 


say an increase in the process mean, larger ranks would be more probable. 


We will consider the sequential rank of IX, | relative to Pauls XI, 
IX, multiplied by the sign of X,(+1 if X,>0 and -1 if X, <0) 
in a process control problem. This method of sequentially assigning 
ranks, as noted before, will be called signed sequential ranking. 

This dissertation defines two methods of assigning ranks in a 
sequential manner to observations Xi» x, o «  Basier properties of 
the sequential ranks are studied and distribution theory is determined. 
Section 2 contains some preliminary results including some relating to 
order statistics of observations taken from non identical distributions. 
These results are used in the later sections. In Section 3 the method 


of sequential ranking is defined and it is shown that for a fixed sample 


size, ordinary ranks and sequential ranks are equivalent for the purpose 


9 








of hypothesis testing. Section 4 is an application to sequential hypo- 
thesis testing for the two sample problem where the alternative is of 
the form proposed by Lehmann in [1]. The signed sequential ranking 
scheme is defined in Section 5 and a condition on the distribution of 
the sequence of observations is given which implies that the signed 
sequential ranks are independent. Distribution theory is given for the 
signed sequential ranks. Section 6 contains an application of signed 


sequential ranking to a process control problem. 


a. lEreliminary results. Let Ky Xe» ee x be any random 


wMeariables with continuous comulative distribution functions |. 


ae 
stata 


F Fa Define Xk to be the k smallest in the set 
n 


le ERs eS } 


{X ae XJ. We can obtain a general expression for the distri- 


alee Ky 


iter One OL xX K as follows: 
n 


(2.1) F(x) = P(X 


nk nk és x) 


n 
= ) P(i X's are <x and n-i X's are > x) 


Letting E, denote the event [i X's are <x and n-i X's are >x] 
there are (3) ways to select the X's which are less than or equal to 


x, and a typical way in which E. Ccouldeoccir ts 


Bo] (X,. <Xy wef ton eee ee 
J4 Ji4+] Jn 


ier Cree til, 2c eens ay G) to take into account all possible cases. 


LO 





Hor jee the events E,. andy ,are disjoint and H, =n... 
a) ly 1 j 1J 
Thus we have 


(") 


BG) = ae) eae dL P(E, ;) 
7 i=k i=k j=l 


and further, when the x. are assumed to be independent we obtain 


P(E, ») = f] Ps Sx) i Gee P(X, ee) ae 
=] 


e m 
iene le 


As a special case of (2.1), to be used later, we have the following 
result when the X's are distributed according to only two different 


@istri butions. 


? Xo» Soe Ay be independent random variables 
where (X,, 1 <i<m} are distributed according to F(x) and 


X., m+1l<i< WN} are distributed according to G(x). Then 
a” = = 


N i ; 
_ . my, ,N-m Pa m- J 
(2.2) Fy) = 2 PAPE F(x) (1-F(x)) 


679(x) (1-G(x) PP 


Proof: Each of the basic events E, (defined above) can be 





written as a union of disjoint events ae j = Oj 22 ese, © Sieme 
Es consists of j X's (with distribution F(x)) <x and i- j X's 
(with distribution G(x)) <x, the remaining X's are >x. There are 
(|) CD ways to select such an event, each having probability 

Fd (x) (1-F(x))™-9 gt79 (x) (aces nae We use the convention that 
Eo) SG) tie iy 


ial 








i 
Remark: When G= F we can use the fact that ) CMe = (3) 
IEE 
to get the known result 


N 
N i N-i 

(2.3) F(x) = 2G) F(x)G-Fa)) . 

é i 

i=k 

In order to derive the distribution theory associated with the 

sequential ranking procedures proposed in this paper the next lemma 
will be useful. We consider a random variable X with a continuous 
distribution function F(x) and define the sign of X tobe 1 if 
A -O and -l1 if X<0Q. Letting E = sign of A, we can compuce tie 


feiom GiStribution function for “and |x| as 


0 -oe<cy<0, -s7< xX <om 


O -oe< y<w, -wa~ <x <- 1 
(2.4) Mee) = 
F(O) - F(-y) O<y<o, -1< x<1 


© 
\ 


Mapes) Ofc, 1 e = ee 


where F(x,y) = P(E <x, |X| < y), 
since for -»<y <0, -2<x<o, |X| >O with probability 1 implies 
F(x,y) =0, for -»~<y<o, -w»<x<-1, E=+1 with probability 


1 implies F(x,y) =0, for O<y<o~, -1<x<i, F(x,y) 


A 


= P(-y <X <0) = F(O) - F(-y) and for O<y<~, 1<x<», 


EEGs) 


P(-y <X <y) = Fly) - F(-y). 


In developing the properties of the signed sequential rank an 
important role will be played by the dependency of the sign of xX and 
|x| and thus we establish a condition whereby E and |x| are 


independent random variables in 





ee ee 


Lemma 2.2 |x| and sign of X (= E) are independent if and only if 


F(-x) = F(O) [1 - F(x) + F(-x)] for all x >0. 


Proof: The marginal distribution for E and |x| are 


O D de ige 
@) y= 
P(E<x) = ¢F(O) -1<x<1 ana P(|xX] <y) = 
1 ee Fly) - F-y) O<y 


and the product of the marginals is 


O =o <— yy <0, -o7< X < om 

O -wo<c y<o, -oe< x<- 1 
(tee) EC IX| < y) = < F(O)[ By) > B(ey)] Oy ee 
By eel =a) OS ey 1 <a 


Thus the joint distribution function of E anda |X| will factor 
into the product of the marginal distributions if and only if 
F(O) - F(-y) = F(O) [F(y) - F(-y)] for all O< y which is equivalent 


to the condition in the lemma. 


Remark: Throughout, we will assume that the basic random variables, 
usually denoted by xX or Y, are defined on the same probability space 
and have continuous cumulative distribution functions. Thus the ranking 
procedures to be defined will always be determined uniquely except 


possibly for sets of meaSure zero. 


1) 





4. The Sequential Rank. In the introduction we mentioned the 
possibility of ranking observations as they are taken without reranking 


the previous observations. We make this idea formal by 


Demmi CVO Sec The sequential rank of x relative to X 


aaa ile 
Xn axe. 4 x Tee aL Xk = xX k= ay 2) =o. 5 H where Xk is 
cn 
the k smallest in the set (X, , Xo an ge X J+ 


Thus the sequential rank of xy is always 1, the sequential rank 


of X,, is lore according as X, — Ay Or xy < Ky the sequential 


Famk Of xX USvie2 Or 5 AcCOrding vaca. is the smallest, next largest 


y 3 


or largest of the set (X, Xe y Pale etc. We use the notation Z. 


for the sequential rank of X.- 


Lemma 43.1 There is a one to one correspondence between the set 


Sieenu., possible orderings xX, < A, <=... — A, and the n: 
i, i, i, 


possible sequential rank vectors (Z,, Ziyy ee 3 Z): 


Proof: We can consider (X 





1? X,» eee) x) = (x,, Oe x) 


where the x, are n distinct real numbers and the set ( (x, y Xp press 
sl a 
vectors obtained by permuting the coor- 


Xs )} consisting of the n! 
n 


dinates of (x,, Koy see y x). The corresponding set ((X, p Xp y ee 
al 2 


Xx. )} gives the n! possible orderings. Now define the mapping 
n 


from the set ( (x, p Xp 9 vee y Ky )} into the set ((r,, Poy see T )s 


n 
.th 
fee lyor. = 1, 2, wee fora ly Syosset Oy Se cite ete 


3E a 


ecoudinate of D(X. 5 Xs 4 sauce ) equal to the rank of x, se ge) els. 
al a al iL. 

ib Bs n | i 
Seek 5 Xs Wy «cee Xs i.e. the jth coordinate is r if x. is the 


ae i 
th 


at : 
it a J J 
ic smallest among a ey Xine se 9 Sky The mapping 9 is one-to-one 


and onto. (This is almost identical to part of the proof of Theorem 1.1 
in [2] page 993.). 


14 





By this lemma we mean that if we consider each ordering, say 


a a merene j ee 
Se ty < -_ of a set of observations (X,, Xo» ; x3 


and use definition 3.1 to obtain the associated sequential rank vector 
(Z, Zip vee Zz), the sequential rank vector is uniquely determined 


and moreover the sequential rank vector uniquely determines the ordering. 


Since a particular ordering of xy, Xs BAe x also determines 
Picmenainary rank veeceor (TL, qs; aca) tT) in a one-to-one manner, 


there exists a one-to-one mapping between the set of Sequential rank 
vectors and the set of ordinary rank vectors. 
In order to obtain the probability distribution for sequential 


rank vectors notice that since a particular ordering x. 1 x. <<. iene ees 
AL 2 
x. determines in a one-to-one manner an ordinary rank vector and a 
n 


sequential rank vector, it is enough to determine a mapping from the 
ordinary rank vector determined by the ordering, to the sequential rank 


vector determined by the same ordering. The distribution of (Z, Ze 5 
_ Z_) is then available for a wide class of distributions of the 


basic variables Xy» X soo xX since Hoeffding has given the distri- 


le, 


T 


burson of (T BE 


7 ; qT) i sane 


Consider the indicator function 


1 Ex = y 
X(x,y) = 
OSeit xe wy 


and for X x define the mapping 


ee 


15 





2 i 
P(X, X5, +++ » X) = G, 2O Xo)y ees L MCX 5» Ce 


J 


, Bax, xp) 


Jal 


ak 

ea 

The i coordinate 2, X(X,, =) is equal to the number of X's in 
j=l 

(X, 5 Xe» pons és XJ which are less than or equal to Xs that is, the 


sequential rank of X,. But since X, < x, Hpens SEN a T, Cie Se 


have 


x = 
(Xe ee 


and this holds for all i and j. Hence we have 


ees X52 -+- > X_) = OCT ee = (4 ee 


and ©® is a mapping from the ordinary rank vectors to the sequential 
rank vectors corresponding to a particular ordering of the basic 
variables. 

Let fr. i=l, 2, ..-. , n be continuous, non-decreasing functions 
defined on the unit interval such that f (0) =] - lL) =O for each 


Ee p> anche oe Lane 


i. Denote by A(f,, f,, --. » £,) the family of all (F,, F,, 


ake 


such that F. = ee where F runs through all continuous distributions. 


Now if X,, X ; x are independent and distributed according to 


ig eG 
Ps Fy ; Ro Lehmann has shown in [1] that 
(a) the distribution of the ordinary ranks te ood. 


obtained from ot Xx = 4 x is constant within each family 


De 


mes 





o (f ee f_). This is lemma 3.2. 


a 


(b) the power of any rank test depends only on f., f., ...,f, 


Ne n 
and that uniformly most powerful tests exist. This is Theorem 3.1. 
Because of the one-to one correspondence between rank vectors and 
sequential rank vectors properties (a) and (b) are preserved for sequen- 
tial ranks. The reason for this is that in computing sequential rank 


vectors we are merely identifying different points in n - dimensional 


Space with each possible ordering Xx. = Xx. mere xX. than when 
ils 2 n 


ordinary rank vectors are computed. Thus the probability associated 
with any subset of ordinary rank vectors can also be associated with 


a unique subset of sequential rank vectors and we have, analogously as 


ell, 
; , Owe O 

Theorem 3.1. Given n functions ts ts) os es to and any 
sequential rank test of the hypothesis H: cP Fy; ; F ) = 
S -, eee hs cs) (i.e. a test based on the sequential ranks), the 

O , 

power of this test depends only on 7, i So unG Tet, seer 
CLE Foy tee y ie and (F', Eco & F') belong to the same class 
Ce Poy tee ie) the test has the same power against these two 


alternatives. Furthermore given any class of alternatives K: 


Gita: > Fo) er (te 


? 7 5) 5 i) there exists a uniformly 


most powerful test based on the sequential ranks for testing H 
against K. 
When x Ky» oe 


the sequential ranks are independent with distribution 


A are independent and identically distributed 


Vere Ist = 1a bea le Oy et einen a i=l. 2 oh 





A proof of this is given in [2]. We see that the mapping defined in 


(3.1) takes the vector of dependent ranks (T eee a ope ) iter ae 
n 


1 
vector of independent sequential ranks (Z, ; Ze ns Z) Thus 
according to Theorem 4.1 and the discussion leading to it we Lose nothing 
in the matter of hypothesis testing by considering sequential ranks 
instead of ordinary ranks, and in fact when we are dealing with inde- 
pendent and identically distributed random variables we find that the 
sequential ranks are independent. 

Since there is a one-to-one correspondence between the ordered 


observations and the sequential rank vector, the distribution theory 


for sequential rank vectors is also completely specified by 


n 
ao ee ; el I} a", (x. ) 
1, 1, ma wll a2 1 Se 
<X, Seer S 
15 ~ 


ae 
rs 
/\ 
rs 

lA 
x 

PS 
T 


em < 
iL 


Bes = food [Pac, (Fx) 


 <X, <<... < KX, <M J J J 
1 ‘n 
n 
= fof eae (y, ) 
Te cio aaety 
O ye ee ee ee 5 Jd 
= 1 -"’i - 
# & n 
where Se F(x, ) and the x. are assumed to be independent in this 
J J 
calculation. Let f = (ae i ? i) and write 
P(X, < X,S--- x) = P(f). The distribution function for the n! 


vectors (Z,, Zs 


possible permutations of the components of f. In order to determine 


; Z_) is obtained by computing P(f) for all 


the marginal distribution for Z. we notice that a. =k df Only a 


18 








Xx. is the ns smallest among the first i observations, and we get 
(355) P(Z, =k) =) P(t) Ce eae eee) 


where fr. iS ohne rae coordinate of f and the summation is taken 
over the (i-1)! permutations of the coordinates leaving f, fixed at 
the ee coordinate. 

For the special case where the Xx. are taken to be identically 
distributed, we can take f(x) = x without loss of generality, and it 


is easy to compute (3.2) and (3.3) to get 


(Gr) P(f) =1/n! 


and PZ =k) = 1/1 k 2.1 2, ieee. = eee 


yielding the independence of Z Z, es a as noted above. 


ee 
Another special case, to be used later, is when the r. are taken 


a. 
to be the Lehmann alternatives, introduced in [1]. We let F, (x) = F *(x) 


a. > 0, and in this case a straight forward computation gives 


gl 
j[ a 
aE 
i 
55) CGS eS co Sk) = 
I (% « 
jy Val 


By relabeling the X's, the probability of any order of the X's can 
be found using (mee giving all the values needed in (3.2) to specify 


the distribution of the sequential rank vectors. 


We, 





4. An Application of Sequential Ranking to Hypothesis Testing. 
In the nonparametric, fixed sample size, two sample problem, it is 


assumed that there are available two sets of observations {X x 


ee ai 


‘ xX and (Y, > Yo» 


distribution. The problem is to test the hypothesis that the distri- 


P YJ each set from some probability 


butions are the same, against the alternative that they are different. 
Usually the alternative is more restrictive as when only a shift in 
location is considered. In this section we consider the nonparametric 
two sample problem as a sequential problem rather than fixed sample 
Sac. 

Let X. ite eee = ctehalel 5 j=l, 2, ... be independent 


random variables and assume we wish to test 
H? G=F “against oe =o) 


where F is the continuous cumulative distribution of the X's and 
G the continuous cumulative distribution of the Y's. We propose to 
use the sequential probability ratio statistic based on the sequential 


ranks and we can assume the observations to be taken alternately as 


XK, Yi, Xn Yon. ore x vy 
Let 73 = (Z, , Zn 3 ae Zn) be the sequential rank vector based on the 


first N observations and write Py(z \/P(Z ) as the sequential 
probability ratio, P. referring to the alternative to the hypothesis, 
Po to the hypothesis. 

Under the hypothesis p(ZN =z) = 1/N! and P (Zz) = 1/N! Under 


z) by noting that each outcome 


N 
the alternative we can compute P(Z 
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vector 2Z corresponds, in a one-to-one manner, to a particular order 


of the X's and Y's. For example 


7? [ai 2 hea = x any 


mol iew ll) 0k. ay Ga S 1 a 


Zz 1 alk, 


Each ie in turn corresponds to a vector ((F, G, F) or (F, F, G) 
as in our example) of F's and G's meaning that the observation 
appearing in the ie smallest position in the ordering of X's and 
Y's has the distribution F or G according as F or G appears 
as the a coordinate of the F, G vector. Thus to compute 


p(2N =z) for all possible values of 2Z we need only compute 


Ores Ws soos UL 


where U, is an X ora Y according to the outcome. In particular 
when f is a continuous increasing function on the unit interval with 
f£(0) =1-f(1) =0, the probability distribution is constant for all 


continuous distributions F and depends only on f. In fact we have 


‘pol i SOR re) 


aS oo SS ae 


a 
GS 
A 
GS 
/\ 
G 
I 


oe i an Gye) 
ee eae 


Sau, < 


by letting y, = F(t, ) where ees) = F(t, ) when U, =X, and 
ie NGE = aCe.) when U, = ¥. 


Baht 








In the special case of Lehmann alternatives f(x) = x”, a>od 


and by (3.5) we get, for N even, 


bt 
}- 
2) 
= 
II 

PS 


where 


ee rT 
TG Ts) a if VU, be 
x - i i 


and the probability ratio reduces to 


Py(Zz) eo a? 
P p (2M) ) tl ( § T(E) 
Pa 2 
A similar result holds for N odd. The vector = = (A), Ay» oun es Ay), 


corresponding to the vector of F's and G's determines p(z =z) 
for [(w/2): outcomes z out of the WN! ypossible. We can compute 


the probability ratios at each stage using the following relations: 


N-1 
eae 5 N odd 
N a 
iC A,) 
P ve a 
aE aes 
Cre) oN = pF 
O 
_ eV N even 
A, 
CS, 44) 
oe 


ea 





2 
(N+1)! a 
ally a N cee 
[2 As) | (a+ da) 
I ae isz-2 
ere) 
P 
lobe 
i PO 
O 
ay N even 
Te N Al 
(2a) Ws dy) 
j=l ea ah jal 


( i=l 


where Z = sequential rank of tye ; 


B 


or Xo » N even. At the N + ee observation 2Z is determined 


e 
emanit 24 =k, the N + ila observation came between the k - 1” 


N odd, and Z = sequential rank 


t 


and the gs smallest observations of the preceding N observations. 


Thus a = (A784, => > AL eee A) where 7 


the N + es observation is an X and A* =@q if the observation is 


a Y. Using (4.1) and (4.2) and Z we can pass from S, to S),, as 


the observations are taken. for example Sy =] 


2a/lta if X, <Y, © i (ies) 


2/lta dere | ae = (a, i) 


A 
bd 
ih 

wel 
| 


29) 





re | ee ou , 
dake ee AL = Vila, ol 
(ita) (2+a) 5 re 
<< 
x, t < xy 
| 6 Y, <5 44 5% _ 
a": <GiaGte) “gg ee 
| 1 2 1 
| As 4,5 4) 
| _28 if a 
(+a) x, = xy = Y 2 es = ea 


We noted before that under the hypothesis a=1 the sequential 


ranks Z Z_ are independent. However, when a #1 we do 


akg Za» eo @ 8 > N 


not have this independence property. Consider the case N = 3 where 


we observe X a xX in that order. The possible outcomes are 


ae lie? 2 
Ordered observations Sequential ranks Proebaba lite, 
+ 
ao Cl ene) a/2(2ta) 
+ 
a Ghee) a/2(2+a) 
+ + 
xX, < ¥, < X, Gia 5) a/(1+a)(2+a) 
+ 
ToS NBG (rie a/(1ta)(2+ta) 
i ee Clee) 1/(1+a)(2+a) 
i i. 2 
Sas x Gl, 2.62) 1/(1+a)(2+a) 


and the marginal distributions are easily computed as 


eh 








P(Z, =1)=1 P(Z, = 1) = a(3ta)/2(1lta) (2t+a) 
P(Z,, = 1) = "1/ ite P(Z, =f = (2tata”)/2(1+a) (2+) 
CaS 2a ita P(Z, = 53) = 1/2ta 


Now 1cCae Zi, Z5) = (1, 1, 1)) = a/(lte)(2te) ana ae) 


P(z = - _ a dta : 
( D 1) 15 (2, 1) (1ta)(2ta) 2(1+ta)_ and it follows that 


ae ZA 3 a cannot be independent unless a-=1 since independence 


of 2, 2, 2, implies (3+a)/2(l+a) = 1 which in turn implies a = 1. 


Thus we have 


Theorem 4.1 Let X., Y 


a Be a 


7 


W? ty be independent random 


variables with x. distributed according to F and Y, distributed 


ay 


according to ie a >Q. The sequential ranks based on such a sequence 
are independent if and only if a=l. 
As an illustration of the sequential probability ratio test based 


on the sequential ranks consider the data given below. 


= 3.926 Xi 0 = Oe “ 4.70 Yio = uellisie 

x, = 3.45 Koy = Soir ae 4.15 oa = 4.29 

X, = 2.00 Xig = 2.94 Ys = 4.55 eae 1.74 

X, = 2.28 X15 = 5-90 Yi een Yy5 = 2-17 

Xe 5.494 Xi = 2-18 Y, = 2.13 Maye eon 

Xe = ee X15 2559 Vea 4.686 Y15 = 4.689 
x, = 2,302 Xi 6 = 2.74 Yo = 2.68 ‘16 =e oi 

Xg = 3.02 Xi4 = 3.492 Yg = 2.36 Y17 = any 

Xo = 20 Xig = 2-70 Yo = 3.93 
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The data is taken from Table 600A page 600 of "Statistics, A New 
Approach,’ W. A. Wallis and H. V. Roberts, The Free Press, Glencoe, 
Illinois. If we assume X has some Pee ioue distribution F and Y 
has F’ as a distribution then 


Px<y)=/ ry) ay) = 7%. 


Suppose we consider a=4, P(X < Y) = .8 as the alternative to the 
hypothesis a-=1. We take as boundaries for the sequential probability 


ratio test 


and if S,<B we accept H: a-=1, if S,; >A we accept K: a = 4 


and if B<S.<A we take another observation and compute 5 


N N+1L? 


repeating the test. Using the computational formulas (4.1) and (4.2) 


we get 
Ss, = 1 Ses on 
Se hue Se nee 
Sz = 2.0 S13 = 764 
S, = 3.2 Si, = +2986 
85 = 4.15 Sis = 467 
S65 = 6.05 Sees 22 34 
S = 8.75 Sy = .168 
Sg = HO. S13 = 242 
Sy = 3.31 Sig = 138 
S56 = .809 Saq = .0288 
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and since S,, < .0526 we accept H at the 20°" observation. 


Notice that even though the probability ratio Oy is written as 


a function of the sequential ranks, in (4.1) and (4.2), it can also be 


computed as a function of the order configuration. By this we mean, 


for example, the order configuration 1 a 1 stands for x, < Y) <i X, 

<4 , 
or XK, Y, <a Xx, and, <a 1. I> stands ster yy oS Xx, < x, or Y = X,, < x) 
ach order configuration determines a value of Oy as a function of a. 


It can happen that for some value of a #1 and two different configu- 
rations, ory takes on the same value. As an example consider WN = 6 


and the configurations alllaae and tlaaatltl. The denominators 


in 96 for these configurations are 


g(a) = ala + 1)(a + 2)(a + 3)(2a + 3)(3a + 3) 


1(1 + a)(1 + 2a)(1 + 3a)(2 + 3a)(3 + 3a) 


g(a) 
respectively. For a= 1/2 and a =2 we get 
g, (1/2) = g,(1/2) = 945/8 and (2) = @,(2) = 7560. 


Let c(t) be the number of different configurations such that 


Sy = t. We have 





(4.3) P(S, = t | a) = 7 


i 
% :,) 
jal ° 


i=L 


where the Ba correspond to any particular configuration making 


oH 
ayo (a. -1 or a according as X or Y is inthe j; place). 


ef 





(4.3) follows because any two configurations which make S, = t have 
the same probability under the alternative to the hypothesis. Under the 


hypothesis 


en ye eee) 
may re ee fa ey 2 SEO BI)? ole 


N 


In (4.3) and (4.4) [x] is the greatest integer function. 


In Wald's sequential probability ratio test the approximations 


1 - B p 
ar Dre 








end B= are valid when the probability of termination 
of the test is 1. These inequalities were derived under the assumption 
that the basic sequence of probability ratios was determined from an 
independent sequence of observations and that the sequential probability 
ratio at the as observation is formed as a product of independent and 
identically distributed random variables. Under the alternative hypo- 
thesis we have found that the sequential ranks are not independent. 

Thus we must now show that the test terminates with probability 1 in 
order to interpret @ and 8B as error probabilities. 


It is enough to show that the test terminates with probability l 


considering only WN even. For N even we can write 


N 


-1 -1/2 1 
(cls) Oy an \| a i 2, ar 
isl i 
= -1/2 <N N 
and define A, =F dA. with y! = & A, and 2, = log y . We 
1 a 421 J al al al a 


consider first the case where the null hypothesis is true. Ay» 


A 


ne, Ay are dependent random variables with 
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(4.6) P(A. = 1) = P(A, =a) =1/2 





giving B(A,) = have 


I 
NM 
rxj 
O 
r 
fe 
“hh. 
Cy 
= 
a) 


i 
nN 


L 
JE A. = di A. = aan =— =_- — 
( ; Mee i) P(A, ap A, a) mn 


0. 
}4 


P(A 


= ee 5 Soe 2 
, 71, A, = a) = RA = ay A. = De Ta 


and a simple computation gives Var(A,) = (=) and Cov(A,, A.) 
Bs 2 ie ue 


l-a Be ot OY 
= as) . Also B(Y,) = / E(A, ) - 2-2 ana 


J 


ey) a” ) a i 
Very) = Var(A, = Var(A a Cov oe 
= 8 2 2 (A) + 2 . L ova, A) 


I 


i 


-1 2 2 
st GS) +? - a get GS | 


1 N-i 


2 
) N- 1 


| 
oe 


L-a 
( o 


bo 


and notice that Var (x) is decreasing in Ji as = eee ee 
itl <a then lL < A <2) ciad a2 2 i < all? if 7 a chen 
aue < YW < aly e 

In order to show that the test terminates with probability one it 
is enough to show that s >o in probability. Thus for arbitrary 


positive B we show that 


Be, 








ete (ah <1/B) = lim P(log S— 


y < 108 1/B) 
N-> © N—-> © 


Let K = log 1/B and use Chebyshev's inequality to get 


p(y asx) r( yah. y B(al) <x - Y at2 %)) 


lA 


P( ya. Let C4) al) SS Tee Exc) 


i=l 


N 

by taking N large enough to make K - a QO. This can be done 
i=l 

since - is bounded, and bounded away from O, we have 


ae 2 
ah a 2 ui 
= Log Y = Oo a 7 (¥5 : oy ( N , 
a ee 
a 
=1/2 Wh 7 
where do = E(Y,) - §_+4— > 1 and c is bounded away from 0O, 
and further 
N N-i 
E(Z.) > log Ne - Cc T(N-1) cee 
veal 
», £(Z;) = N log >, - O(log N) = O(N) >O. 
i=l 


Thus 
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and 


<j 
9 
Fy 
7 
Be 
N 
Hs 
eo 
II 


N 

yy Vara) + 2 y Cov(zy 7) 
& il LS, aie 
i=l iy 


N 
Y var(Z.) + ot oy (var (z¥) var(25) 1/2 
izil : i<j i J 


lA 


Now, expanding log Y, in only two terms 


Var(Z.) E(log a, - E(log aye < E(log i - log SP 


yN 2 


r 
ss ar ) Se Var(¥,) = ¢ L(N-1) 
at 


N-i 


an’ i(y-2) 


is decreasing in i. Now we can write 


/\ 


N N 
var ( yy _ < O(log N) +2 c ) (N-i) Var(Z,) 
i=l * = : 


O(N log N) 


and finally 


N 
p( Yat <x) < O(N log N) +0 as Nom, 
5 ae = Ne 
i=1 O(N) 


at 


N 
Since log S = ) zs we have S.- 3 in probability, and when the 
= 


N 
null hypothesis is true, the test terminates with probability l. 


pul 








More generally now consider 


-l -] mall Z 
4.8 Ie = rie — = - = = 
(eee HA <8 BS) P(KD = ys leg 8 - uy SK, - By) 
-1 -1 -l 
where K, = log A’, iC log B and Hy = E(log Sy ). For large 
enough values of N 
ie a 
U9) 1 ! Var (log os) Y 
ll “ae 
Py S 
-l 
A Var (log Sy ) 
P ae > = i =: ico 
(| log Sy Hy > K, Ly) 5 it Hy SS 
(K) - Hy) 


dné test will terminate with probability 1 as long as F.—> 0, 
and this is independent of the true distribution of the Y population 
since the inequalities in (4.9) were obtained without reference to the 
distribution of the Y's. In particular we found that when the X and 


Y populations are identically distributed, uh, = O(N) and 


-1 
N 


The method just given to show that the probability of termination 


Var(log S.) = O(N log N). 


of the test is one is not satisfactory for all alternatives since the 
verification of condition (4.9) is difficult. We now consider a better 


approach. As before take N even and write the probability ratio as 


al 


N ~ 
-1 121s 

(4.10) ee I {a ze A,} 
=] j=l 
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In order to show that the probability of termination is one it is 


-1 
enough to show that N Mores 1h converges to some non-zero constant 


N 


since for fixed boundaries A, B, the equivalent formulation 


iL al. al iL - 
Ty lon 7h 5 = N log Ty << N log B 


il 


will terminate with probability one, provided nt log T, converges in 


N 


probability to some non-Zero constant. 


et ren = N and let 2 a 


De otee ay be the order statistics for 


leg 
the combined sample. Define the empirical cumulative distribution 


functions for the X's and Y's as 


{number of X's < t} 


F(t) = = 
{number of Y's < t} 
G tt) - n 
a 
Since ) A, = {number of X's in Z5y Zyy vee y Z.} + a{number of 
j= 
SE aleal Z 3 Za, cae 2} we can write 
il : n n 
= ) A= > PAZ oe aro a 
it fell J) 2 Minewee i ‘ale a 
and 
= log T, = - iz log a - log 2 + log N - = log N: 
ny °S “nN 2 N 


+ 
lh 


unre 


& tos (F(Z) +2 6, (2,)). 


D2 








Since lim (log N - = log N!) =1, we have 


N—->o 


N 
: =k it 
lim N~ log Ty = log e/2Va + lim 7 ) log (F(Z, ) i G_(2,)) 


N —> © N > © steal 


[08 (Fix) +a G(x)) (aF(x) = aG(x)), 


—=00 


log e/2Va a 


MOIR 


the latter limit following from a result of I. R. Savage and 


J. Sethuraman communicated to the author by Sethuraman as 


Theorem (Savage-Sethuraman) Let X_, K. ... X,¥_, ¥., 22. ¥ 


17 2 aoe n 

be independent random variables where the a are distributed according 
to the continuous distribution F and the Y, according to the contin- 
Z5 9 


statistics of the combined sample and let ay and Ge be the empirical 


uous distribution G. Let Z wees Za (N = 2n) be the order 


med 
cumulative distribution functions of the X's and Y's respectively. 
Then 
a ian 
N~ ¥} log (F(Z, ) + a G(2,)) a5 [ log(F(x) + a G(x))(aF(x) + aG(x)) 
steal “—{0"0" 
in probability. (see [10]) 
In our case G=F or depending upon which hypothesis holds. 
However we will consider the entire class of alternatives Fo lee 19, 


which could hold. Let it log ue > Li(p). Then 
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00 b 
L_(b) = log e/2va +s f log(F + ai) d {E+ aF) 


=—0O 


i Oo 
a5 f log(F + Be) a({l = ie) 
(2) 
1 lta 1 ar b 
= log e/2va rs LOR Recica at [ log(t + at )dt 
a 2a 
0 
aT! oe 2 = log a eae (1+a) pau f° log(1+ eed 
S 2 2a he 2a J laa 


a 
b-1 
iMate, 28 bye een esa) ‘i log Mierat jdt decreases as b INELeases, ~andetiic 


@) 
L_(b) is monotone in b, decreasing when 1< a, and increasing when 


cle <u 
Under the null hypothesis b=1 and 


2 1/2 


i (1) = log “= 4 2 O° for deo 


Under the alternative hypothesis b =a and 


se : /2 (a-1)° 1 - 
L (a) ace C 7 2 l-a 
OQ: “a. re 


dt. 


In order to show that the test terminates with probability 1 we must 
have L (a) #O for a#l. In fact we will show that L fa) 26 
Horeca + 1. Notice first that it is enough to consider 0 — 7 <7) 


since 


72 








12 . a7? (72 : 1)? “ 


i) 
be -— =, = > an =r ewe © 
a 2 0 -1 " ,i-l a 


L, (1/a) 


1/2 1/2 2 i 
ee a, + a : ac) f 


I 


-1/2 1/2 2 ee) vee: 
ge Cee See ere [ ce 
O 


Te Seles 7 (a-1)° : oa ee Pe: (5) 
6 2 0 l-a a 


We can write 


aL (a) 


2 il 
OW ast 
i 2 il 2 
- f —{acl)" : at - | (ei) at 
O ha + (a-1)°t O att 


1 
il 1 
(a-1)* | + > eee : at 
O ee (ii ee ela 


1- O 
and we wish to show that att ~ < 4a + (a-1)> t for O = t= 1 and 


O<a<l1l. Define 
Hla) co), =) Seer) ee 


and notice that 
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> Cay oom Gers) ie ae << ee ten) = (ash) <0 


-(atl) . 6 


—~ = a(l-a) t 
Since h(a, 0) = 3a, h(a, 1) = a(a +1) we may conclude that 
h(a, t) > QO, which makes the integrand in aL (a) negative as was to 
Dem oroved. 

We have shown that the sequential test terminates with probability 
one under the null and alternative hypothesis and moreover the test will 
terminate with probability one when the Y's are distributed according 
to EP for b>O except possibly for only one value of b. This 
follows from the monotonicity of L_(b). 


We also remark here that for a fixed sample size test of 


X~F, YF 


K: 


against Hy: Xo eee a 7 a Oo 


using ranks of observations, the Neyman-Pearson theory would give a 


most powerful test of the form 
-1 
accept H for SS, >K. 


i. -l iL 
An equivalent test would be to accept ae la V log Sy a log K. 


Assume a>1 and let L(b,) =©O. When 


oT 





lim P(= log ee = 


lor) 
N 
ia 


al la 


Loa P(= Wee eee = Sa) 


N—-o© : 


=a 


and thus for a test of the composite 


Vs ees i, a 


O 


=] ales 1 ~ Fe b 
=l she AC Fe db 
hypotheses 


OE Ore =>.) 


against eh xX F, Y~ Fe bata) <b 


it 


mae Lest is consistent. 
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2+ The Signed Sequential Rank. We now extend the ranking procedure 
defined in Section 3 to include the sign of the observation. This corre- 


sponds to the signed rank statistic used in fixed sample size problems. 


Definition 5.1 The signed sequential rank of x relative to 


vy A 


re x is the product of the sequential rank of IX | relative 


7 


to IX, |, Bal: eben, IX | and sign (x), where sign (x) =1l if 


fae OO end Sign (X ) =-l if xX <0. 
i n n 
In the case of sequential rank vectors there are WN: points in the 


x xX and in the 


sample space corresponding to a sample X pr see yg Xp 


i bigs 


case of signed sequential rank vectors there are ZU N.; points corre- 

sponding to the same sample. Of course if the basic variables (the x.) 
are positive random variables (or negative) the signed sequential ranks 
are equivalent to the sequential ranks. 

We found in Section 4 that when the basic random variables are 
independent and identically distributed the sequential ranks are inde- 
pendent. This result does not hold in general for signed sequential 
ranks and so we now determine a sufficient condition for this result to 
hold in this case. 


X 


Let X >? 


»+- » X, be independent and identically distributed 


ae? N 


random variables and let a, = sequential rank of IX, | relative to 


IX, |, IX, ee IX|, E, = sign (X,) with Y, =E, Z, i =1, 


Peon. , MN. if F(x) = P(X, <x) “satisfies themecndiciem in lenges 


it 


E IX, |, IX1, ee Ix, | are independent and it follows that E, 


and 4. are independent. Thus we get 


oy 





OC eC 2 i) Sy a) 
ew) = (1-F(0)) 1/i 
cere =) > Co a) = ay = -1) 1 CED, 
= ECO) a 
eam Le) ae el, 1 2 j) =1/i follows 


from (4.4) 

We will now show that the condition given in lemma 2.2 is a 
sufficient condition to guarantee the independence of the signed 
sequential ranks. 


Theorem 5.1. If X X X are independent and identically 


1? Xoo oct gs By 
distributed according to F(x) where F(-x) = F(O)[1-F(x) + F(-x)] for 


pulleex — O then the signed sequential ranks 7), 1 ; 1. 8take 





1? ed N 
independent random variables. 
Proof: Let ay Lyx cee i) be an arbitrary outcome vector for 

(Y,, oy P Yn? and let k be the number of positive integers in 
Coe Loy see a) We have 

lee _ 4) - LF) F tro)" 

m m Ne 
=|) 


from (5.1). Each outcome vector corresponds to a particular ordering of 
the X's, with N-k of the X's negative. The absolute values of these 
N-k X's have a particular ordering among the positive X's. So each 


outcome vector is equivalent to an event like [0 < Ey x, < €, x, < Vani 
Ile oO 

< Evy x ] where k of the €, are 1 and N-k are -l. ‘The distri- 

N 


bution function for -X is 1-F(-x) and using F(-x) = F(O)({1-F(x) + F(-x)] 


we have 


LO 





dgd{1-F(-x)} = -aF(-x) = a4 aF(x) . 


Hence 
PY, = 41, ¥ = 4p, » Vy = Ay) 
\ N 
= 12 01a eee Gaur < €. Zz | 
a vl Jy wy, : Fe oy (v5) 
C= yy < <Y¥y <” ie Ji 


II 
=e 
+! 
1 jay 
ANS 
© 
eC a 
7 
x 
c 
ise 
rma 
ss 
r< 
si 


Oy ee 
N-k 
= rey PO <x) <= Fee 
1-F(0O meee: Jy 


, | F(0) - PGs X., for all i ) : [r(o) P* [1-F(0) 


1-F(0) N: N: 
N 
Thus Be = Tp aay 3a, Gy = iy) = nate = is) establishing 


the independence. 

Remark: In the proof of the theorem we have assumed that F(0) f Ls 
ier oO) = 1, the X, are negative random variables and the signed 
sequential ranks reduce to {-(sequential rank of Ix.1)), which are 
independent. 

Mie condition F(-x) = F(O){1-F(x) + F(-x)] forall x 2>0o jis 
satisfied by distributions of positive, negative and symmetric (about 0) 
random variables. A larger class of distributions satisfies the 
condition. If we consider all measurable sets Ac[0O, ») and define 
soe = X € A}, then the condicion 


Pr{X e€ A} =k Pr{X e€ - A} Is 2) Ol © Fala = oe 


is enough to insure that F(-x) = F(O){1-F(x) + F(-x)] for all x >0, 
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des) 
F(O 


F(-x) = F(O)[1-F(x) + F(-x)] for all x >0O. On the other hand, 


since taking A = [0, o) we get k = and taking A = [0, x] we get 


starting with F(-x) = F(O)(1-F(x) + F(-x)] for all x >0O we get 


ar(x) = 7S) af-7(-x)) 
and 
: SenOn. _ 1-F(0) 
Eaee A) = ioc = “B(6 a d{-F(-x)} = FO) Pr{X € - A} 


We now consider the asymptotic distributions of sums of signed 
sequential ranks based on observations from a distribution satisfying 


x 


the condition in Theorem 5.1. Let X a XO be independent 


ae? 


identically distributed random variables with common distribution 


function F(x) such that for all x >0O F(-x) = F(O)[1-F(x) + F(-x)] 





holds. Define a = signed sequential rank of xX Using (5.1) we 
get easily 
gta cel 
E(Y ) = (1-2F(0)) —3 


(5.2) 


var(¥_) -(3- (a-2n(0))" ) 42 ‘és 


oO 
i 12H) 
“(3 ; (gro) 


When F(x) satisfies the condition of Theorem 5.1 the signed 
sequential ranks are independent, but not identically distributed, and 


forming the partial sums, S| = 


Y. we have 
n a 


It ae) 
04 


4o 





B(S.) = (22800) on (212-¢H(0))) . 


Var(S ) ) n(n+1l)(2n+1) ue (2-0-2710) \ n(ntl) 


: Gaon } . 


Pas clr a = Vvar(S_) it} follows that for 


(5.3) 


New tor € >0, k=1, 2, 


large enough values of n 


(y - B(y,))° aR, (y) = 
ly-E(¥,)| >€ 9, s 


because the range of integration becomes a set with zero probability 


Since Yi is bounded according to Bees k and o> oe yo! 2 Thus 
women = 0 the integral is zero for all kK =i, 2, 3... , 0 one spy cn 


Lindegerg-Feller Theorem it follows that S is asymptotically norvwal. 
If we normalize the signed sequential ranks and then consider 


partial sums we get 


ee E(Y,) 


S* = 
oy (var(y,)P/ 





and 
| epee e ta) oy 3 D 
e< < = Jone 
= 7. 1/2 ie = 
Tes iat Ve | (x, a° +4 + ,) (, +4/i + a/ i) Voy 





as i-—- oo where oO 


sequential ranks are uniformly bounded and by the bounded Lyapounov 


= 2 = 1-27(0))° Hence the normalized signed 
Theorem S*//n is asynptotiecally distripuved as a unit normal random 
variable. 

As was noted in the introduction, some statistical problems are 
concerned with detecting a change in the distribution of a sequence of 
observations obtained from some process. We now consider the case where 


X 


Ase 


in the basic set of independent random variables {xX 


ili Ww 


the first m are distributed according to F(x) and the remaining 
N-m are distributed according to G(x). As before let Y, denote 


the signed sequential rank of Xi: Since each possible outcome vector 


for (Y,, Yop tee y ed corresponds to an event of the form 
[O< 6.) XX ” << see ee | 
1 i, a i, N iy 

where ¢€, = +1 and (i,, sas, a? is a permutation of 


(1, 2, ... , N), the joint distribution of the signed sequential ranks 


is obtainable, in principle, from 


(5.4) Pos €, no. < €, 2 ono SS Se x P(F, G, ¢) 


where € = (<,, Eo» ae In general (when F #G) the Y. are 


Ww? 
not independent. For example if we are sampling from an unknown distri- 
bution F(x) and we wish to detect a change in distribution to F(x), 

a >l1 (a stochastically larger distribution) where F(O) = 0, we lose 

the property of independence. In this simple case signed sequential 


ranks and sequential ranks are equivalent and taking N = 3 with m=l 


we have 


uy 





7 _ _ —e 7 7 1+4a 
ie se) ode PY, ee aS © Ee, 2 os 2(1+a) (1+2a) 
dk 
P = = = = 2 
and Oe vee | Y, 7) SIGE In general, for a>l 


ae alk 14593, 9a 
(ida) # iva Ba) (awa) * ATA) QP 
Since there are cases when the signed sequential ranks are inde- 
pendent we now determine the marginal distributions for signed sequen- 
tial ranks in the case where a change in distribution from F(x) to 
G(x) occurs for arbitrary continuous distributions F(x) and G(x). 
Let xy» OF oe 
1<ig<m distributed as F(x) and X,m+1<i<WN distributed 


xy be independent random variables with Xx. 


as G(x). Take N=m+n, and let Y, be the signed sequential 


rank of X, and H(t) eee tS GLU rourlos finer tener ce te" 


order statistic from the set (1X, |, IX, 1, Neneh og? Oe It is enough 


N-1)° 


to determine the distribution of ty" Using lemma 2.1 and P(|x,| < x) 


= F(x) - F(-x) for x >0O we get 


(525) 

N-l i n- . : m-j 

H(t) = YoY GGT CRC) - F(-t))° (1 - F(t) + F(-t)) 
i=k j=0 9 
(a(t) - G(-t))7-9 (a-c(t) + G(-t))2-P7d-> 
te 

Now let Z, de the Po onder teva et elmer (1X, |, IX I, coe 
par |) Then 
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PCY, = 1a 1 (i xy < Zz.) = E(G(Z, )) - G(0O) 
- |e dH, (t) a 1ghee)) 


=1 - Go) fo H(t) aG(t) 


Also for 2 Se = 


PCY, =k) = P(Z, Sue Z) = E(G(Z, )) Z E(G(Z, 1 )) ; 


Now E(G(Z )) = : G(t) dH (t) =1 - : (t) aG(t) and 
Or af ote) ace) 2 - fa, 


P(Yy = k) 3 (4,_,(t) - H(t) ac(t) 


k-1 ei 


FS MOS | le - re) Gere) + r+) 4 
0 


jro ° 
(a(t) - G(-t))* 29 . (1-4(t) + G(-t))7 =" ac(t) 


For k=WN we get PCY, =N) = PZ, 4 = Xy) =l- E(G(Z, 4 )) 


10.0) 
-/ Hy y(t) dG(t). For negative values of Y, we can calculate 


PUY. aa). Ca Se OS Z, ) in a similar manner to obtain finally 


flops =< k <N = 1 
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P(X, ae eGo) - fae) aG(t) 
O 


P(Yy =e =" GO) + fe) dcl=e) 
O 


I 


Py 3 eee fe (F(t) ~ F(-t))9 (2-F(t) + F(-t))™9 


(5-6) -(G(t) - G(-t))*2°9 (eect) + G(-t))2-**F aa(e) 


P(Y, = -k) = - (ea al (F(t) - F(-t))2 (1-F(t) + F(-t))™9 


- (a(t) - G(-t))* 49 (1-c(t) + o(-t))™**9 acct) 


ae a) -/ ee) aGc(t) 
O 


Py =< = - f Hy ,(t) ac(-t 


The equations given in (5.6) can be written in one formula as 


es) = Von 1- =n (F(t) - F(-t))? (1-F(t) + F(-t))™4 


G7) | | 
- (G(t) - G(-t)) 779 (1-c(t) + @(-t))***9 ac(et) 


Moers e = +i] and k=1, 2, ».. 4, Ns Verifieation thay (5.7) ~edue-> 
to (5.6) in the case k =1 and € = +1 can be accomplished through 


the following result 


47 





N a 
Lemma 5.1. > De 


: . ne ae 
Ma_;) p’(1-p) 9 q’ 2(1-q)" "9 = 1 (N = min). 
i=O j=0 


ee . ae Par: 
Eeoor: het ai, 5 (5) (4-5) pi (l-p) 9 gu 9 (1-q)"" 79 sand recall 
the convention of (>) =O if b>a. Instead of summing as indicated 


we sum along diagonals and get 


N i N N-£ 
Pes Lays 
i=O j=0 £=0 j=0 
2 nae m,;n J m-j 4 n-4£ 
= dy 2 C)(,) pi (1-p) Yq” (1-q) 
£=0 j=0 
= n L n-£ ee m, 3 m- j 
= ), (,) a (1-a) 2» (.) p*(1-p) y 
v=O Jj =O 


II 


N-2£ ; 
a7 a £ n-£ m m- 3 n 
id) © ley ay p'(1-p)" 9 since (7) =0 £>n 
£=0 j=0 


Sincesso = 2 < 1 the upper limit in the second sum is N>N-2>N 
-n=m implying m<WN- £4 and making the second sum always equal 


to 1. Using the binomial theorem a second time gives the result. 


ifegumie@ep = M(t) - F(-t), q = Gt) = Gl=t) Swe cam write 


H, (t) =1- {1-p]” ea to complete the verification. 
Using Lemma 5.1 and (5.7) we can compute the characteristic 


monet ion. fOr ty as 
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iu ty 


(5.8) e(u) = Efe y= 


ae iu nt iu, 
Jet trate) + ale) 1" pte) + ley ef” ate) 


co 8, ae eae a 
5) e[1-q(t) + a(t) e-"] = [1-p(t) + p(t) 7") ac(-t) 
O 


maemo pit) = F(t) - F(-t) and q(t) = c(t) =] Got 


Differentiating (5.8) and setting u=0O we get 


(5-9) E(Yy) -/ (1 + (n-1) a(t) + mp(t)) afG(t) + G(-t)} 
O 


(210) 
B(ve) = 1 + (m-1) 28+ 5) + [ (ample) + 2m(n-) a(t) P(t) 
0 


+ m(m-1) p°(t)) dq(t) 


The marginal distribution of Yy» equation (5-7), holds for 
arbitrary continuous distribution functions F and G and thus (5.8), 
(5.9) and (5.10) are the general expressions for the characteristic 
function, mean and second moment of the ty Thus to generalize (5.2) 


to arbitrary continuous distributions F we let F=G in (5.9) and 


(5.10) and we get 


a [o@) 
Sa) a — hee 1), (3 - G-(0) -f G(-t) ac(t) ) +1 - 2G(0) 


(sso) B(Y,) = . 1 


Mls 


Jk 
2c 


49 


—_, 





6. An Application of Signed Sequential Ranking to Process Control. 
As stated in the introduction, in the process control problem we wish 
to determine a procedure which will determine when a given sequence of 
random variables changes from being distributed according to F(x) to 
a different distribution G(x). In particular we will consider the case 
where F(x) satisfies the condition of Theorem 5.1 and changes to G(x) 
which also satisfies the condition. Inasmuch as the distribution of the 
signed sequential ranks depends on the parameter F(0O) we will of 
course require G(0) # F(O). The procedure described in this section 
is still applicable to cases where G does not satisfy the condition 
in Theorem 5.1 but we do not have exact results in such instances. 
However empirical results are presented at the end of this section 
bearing on the effectiveness of the procedure for special cases. 


Let xX x 


y? %or +88 be a sequence of independent random variables 


(observations on a process) with common distribution function F(x) 
where for all x >0O the condition F(-x) = F(O)[1-F(x) + F(-x)] holds, 
ends let to XY, 


define the cumulative sums S =Z +Z +... +2Z where Z. = Y,/i. 
n 1 2 n 1 a 


be the corresponding signed sequential ranks. We 


Since the condition in Theorem 5.1 is satisfied the a. are independent and 





1-F(0 ie ne 
(22) P(Z = tie 
F(O 1 2 fel 
: os a? te? 2a ere oe 


Some easy computations yield 


20 





(6.2) 


na) BRO od 
var(Z, ) -3 - (2=28(0) 2n(o))"| (+4) + {3 : ovat 0) c i 
R(Z) = 2 ot = 


E(S_) = a2K0) (n + a 1/1) 


Var(S_) = {2 _ Goo ay. (n : y 1), {2 - (2800) =2 RO) : _ i 2 


Although tedious, the distribution P(S = t) can be computed 


exactly. For example 


) | ) | 1-F(0) t=l 
P(S, = t) = P(Z =t) = 
: H F(0) ere il 
(1-F(0))° 3 
7 oan t = 5 ; 2 
(1-F(0)) (0), 
2 2 
P(S, = +t) = P(S, +2 = t) = (1-F(0)) F(O) 6 = 0 
Ae F(0) ae ; 
((0))° 3 
2 a 


and in general 


(6.3) P(S = Gj) = ee = t-Z ) = IS = t-x) P(Z_ =o) 


n-L ak 
where x ranges over -l, - ——, «+-5- = 


aE n-1 
n n 


: pe her soee ger or le 
The procedure we will propose will stop the process whenever SA 


does not lie in some fixed open interval (b, a) where 


Oy 





-~se< b<O<a<m~, In order to determine the operating characteristics 
of such a procedure such as the average number of observations until the 


process is stopped we must compute 
(esb) P(N=n) =P(b<S. <a, i=1, 8, ... 50-175) € (b,a)) 


N being the smallest integral value for which Sy does not lie in the 


60 
open interval (b, a). Then E(N) = ) n P(N =n) gives the average 

number of observations as a function a, b and F(O). In order to 
compute the probability of reaching the boundaries b and a for the 
first time at time n the following procedure may be used. We define 


F, (x) = P(S) Sy, F(x) = P(S, <x, b<S, <a) and in general 


ile 


(6.5) F(x) = P(S_ <x,bd<S8, <a i=1,2,..., no) 


a 
It follows that F,(x) = P(Z, <x-S,, b< 5, <a) -| ne aF, (y) 


and in general 


a 
(6.6) BG) =| om, Gey) ar, 4) 

The probability of reaching boundary a for the first time at n is 
De, 2 F(a) and the probability of reaching boundary b for the 
first time at n is F_(b) Z ee Using these probabilities we 
can also calculate E(N). 

Computations of the probability functions in (6.6) could be 
carried out and the computational burden lessened somewhat by noting 
that for large values of n, the a tend to become identically 
distributed. We now consider some approximations to E(N) using some 


results from sequential analysis. 


D2 





Using (5.1) the characteristic function of Z is given by 


7 Leno) Pr u/n_ -iu(1+1/n) F(0) ae u/n 7 -7iu(1+1/n) 


ee ee 





and using limited expansions of exponentials we have 


ae =10 
(6.8) 9(u) = Lim 9,(u) = (2-F(0)) SSS + Flo) =F 


which is the characteristic function of a random variable with density 


120) ete gee (6 
(6.9) oo 


are) Oi glk 


For large values of n, Z has approximately the density of (6.9). 


The moment generating function associated with (6.9) is 


Z t 
fon) M(t) = F(o) oe + (1-F(0)) i 


which exists for all real values of +t. As an approximation we will use 


1-20) In the cumulative sums 0 = 2. ee tet the 


E(Z ) ~ 1 2 n 


a. are independent and as noted, not identically distributed. However 
if we disregard the first few signed sequential ranks and start later 
in the sequence the approximation to identically distributed random 
variables improves. As before, we take N to be the smallest integral 


value for which S, does not lie in (b, a). We use the results of 


Held >) in the sequel. 


a2 








Consider first the case where F(0) = 1/2 (F is symmetric about 0) 


Here E(Z ) = © and using (3.8) of [5] 


When F(0) #1/2, E(Z) #0 and we can use B(S,) = E(Z.) + E(N) ana 


the approximation E(S,) = aP(S, > a) + b(1-P(S, > a))' “to get 


-3 ab F(O) = 1/2 
eel) E(N) = 
2b + 2(a-b) P(S, 2) 
1-25(0 F(0) # 1/2 
Let h be the non zero root of M(t) =1. A further approximation 
gives 
1. abe 
; = 2s) Vee ee 
Gre ) P(S, 2 a) wah _ 0h 


where of course h depends on the value of F(O). Setting M(t) =1 


1 i say a 


Oo - ae - aa 


we get F(O) = which must be solved for +t. Each 


solution corresponding to a fixed value of F(0O) is a value for h in 


ioele) vteciding, in turn, a solution ctomerll): 


t 

a 4(1-cosh t) + 2t sinh t _ 

ieteeett) = See . Then g'(t) = 4(1-cosh t) ---= t 
e-e -e 4(1-cosh t) 

and considering the numerator a(t) = 4(1-cosh t) + 2t sinh t we find 


a'(t) = sinh t + 2t cosh t and moreover 
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at) = 0 t <0 
a'(O) =0 
a (Ce) 26 iti O 
Thus a(t) >0, making g'(t) >O and g(t) is monotone increasing 
in t. As F(O) increases from O to 1 the solution to 
iat t = au 
F(O) = per ae SAY h(F(O)) increases from -~ to ™» . Notice 
2-e -e 
that 
-t -t 
Lim g(t) = lim S—~*8—-= 2] 
t — © t—o>~ 2e -l-e 
t tc ot 
i 7 
Jk aliael ee) = lim KS 0 
t > -~ t—3-~ 2e -e - 1 
- Ll- a 
Now for h = h(F(O)) increasing, P(S, 2 a) = Tah bh is decreasing 
Pieeiesince Tor 
(oy =< 
Bn) ah bh 
e  -e 
we have 
+ 
ee (ae (2 b)h | leet ; pee} 
ah bh,2 
(e~ - e ) 
(atb)h 


and considering the numerator after factoring out e 


show 


elelo) = soe 


+ be 


we have to 


-ah 


<0 for all h. 


ey) 





eoroing 2 = a/a-b, B = = b/a-b we have @+ 8 = 1, @, B>O and we 


must show that 1<Q ePla-b)h + B o7tla-b)h . Let f(h) =a eP\a-b)h 
-A(a-b)h 

+ Be (a-b) and notice that f(0) =1, f'(0) =O with f"(h) >0o 

since 


Bia-b ih -A(a-b)h 


moh = Clete) e = Cl(s-b me 


f"(h) =a 62 (a-p)° ePia-b)h ‘i oPB(a-b)° 2 la-b)h 


Thus f(h) attains its minimum value at h =0. For increasing values 
of F(O) the corresponding values of h = h(F(O)) increase and 


Ee 2 2) decreases. For F(O) #1/2 we have 


oe ooh 
oe 2(a-v( 5 ea) 
= = ¢ 


(15) E(N) = “== OT CON enn 


In particular taking b=-a, h #0 we have 


: Pilea - sinh(ah))(1-cosh(h)) 
(6.14) B(N) = sinh(ah)(sinh(h) - h 


Eom 90, 2(N) = ne and (6.14) is plotted in Figure 1 for 


de 2G = ae 


ees Pes =e 


selected values of a. g(t) = is shown in Figure 2. 


E(N) is plotted against F(O) in Figure 3. 


Suppose now that a process is observed according to some measurable 


X 


v2 Aor tee» distributed accord- 


characteristic and we have a sequence xX 
ing to F(x) where F(x) satisfies the condition of Theorem 5.1 and 


moreover we assume F(0) = 1/2. If we set boundaries (-a, a) a>O 
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and use the rule which requires us to stop the process when =a ¢ (-a, a) 
for the first time we can expect to continue for Bete observations before 
stopping. However if the process is such that F(0) #1/2 we will stop 
the process in the reduced average time as given in Figure 1. Similar 
computations can be made for arbitrary intevals (b, a) using (6.13). 
However, in the process control problem we wish to detect when a change 
takes place in the distribution of the basic random variables. We have 
seen that when a change takes place from F(x) to G(x) at some point 
in the sequence, the signed sequential ranks are no longer independent 
in general. Suppose the change is to a distribution G(x) such that 
the condition of Theorem 5.1 is still satisfied and the change takes 
place at time m. Intuitively, one might feel that for large values of 
feeene Gistribution of the m + abe signed sequential rank would depend 
very little on F and m. This being so we could assume the sequence 
(4. } to be independent for the purpose of determing the expected number 
of observations until the process is stopped. For example suppose we 
take (b, a) as the continuation interval and denote (6.13) by 

E(a, b, F(O)). Given that S,=% b<x <a the expected number of 


additional observations under G(x) is 
(6.15) E(N|S_ = x) = E(a-x, b-x, G(0)) 


_ 2(b-x) Ase Dp) ee 


= 1-200) “=n one wah | bh 


mhe conditional distribution for Sia is 
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ce 
P(b < sis x) 


P(S_ <xlb<S <a, b<x <a) “P(<S <a) 


(orien, = |e oe coe eas 


and the unconditional total expected number of observations is given 


by 


(oe E(N, m, G(O)) =m + Fay Fe) [ E(N|S_ =) ake (x) 
le) m 


Yo lend some support to the statement that for large values of n, 
thewdisuribution of 2 tn does not depend too much on m and the 


Gissriopution of xX x 


12 Bor see» x (and thus could be taken as G(x) 


to justify (6.15)) we examine its characteristic function as n 7» 


We have 
iuZ 
lim @ (u) = lim B (¢ ) 
noo " no 
n-1 
oD ae a ee 
+ + 
= lim [ e mn (1 -(+) -oCHe |= 
n7a”~ 0 


mM 


i, 


-2(+) + p(t) e ™™ ) ac(t) 
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i: 


ne- 
ae a 
2. olin f e leotu) + q(t) e —_ 
n > © : 


m 


42 
(2-r(t) + p(t) e ™ ) ac(-t) 


n-1 


CO 


7 [ Lim (1-a(+) aaa s = ) aG(t) 


yi 


n-L 
-[ rim (rate) + aleye B®) ac(-t) 
a ac(t) - ean ac(-t) 
O O 
Also, since q(t) = ae - 3A andwee g(t) — == -1 
we have 
Lim 9,(a) = (0) HE— + (1-6(0)) = 2 


11 eae 22.2, 


corresponding to (6.10). 


We now consider a case where we have a change from a distribution 
satisfying the condition in Theorem 5.1 to another such distribution. 


Imagine a production process where some dimension is measured on the 


items being produced. Let these measurements be X_, X assumed 


1’ Be 


to be independent and identically distributed as F(x). Each item is 


subject to inspection and if x. <O the item is removed from the 
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production line with probability p. The result is a new sequence 


say C,, C,, .-. and we call this random censoring and (Cc, ] the 


oe 2 


censored sequence. The distribution of C. can be found by 


(oe) 
P < — = — 
(Ogee Pope = tlc, Xeegea) BS yal cs) 
CO 
; XP aja St1C, = Xi 54) PO 5 = %) 
[9 @) 
= P(X) < t1X, = C,) VP(C, = X45) 
j=l 
= P(X, < tIX, = C,) 
or =f =< 0 
P(C, < t) = P(X, < t 1X, = C,) 
; P(X, < t, X, <0, X, not censored) 
CHES 
P(X, <t, X, not censored) 
a 1-pF(0) 
= (1-p) F(t)/1-pF(0) 
Oras uO 
P(X, < t, X; <0, X, not censored) + P(X, <t, O< x) 


\C 6S) 105) = = 


Sale 0), 
a BO ape 
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For random censoring when X. <0 we have 


ey; F(t) t <0 
(6.9) BCs ta) 
F(t) - pF(0) 
i c t>0 


In a similar way if we censor with probability p when X. > QO we have 


F(t) t <0 
l-p + pF(0) = 
(6.10) P(C, < t) = 
1-p) F(t) + pF(0O) 
l-p + pF(O Bee 


Suppose now that the symmetry condition F(-t) = F(O)[1-F(t) + F(-t)] 
Holdsprorsatinegt > O. In the case Of randompcensoring tor Xs O we 


have for t +0 


G(t) = P(C, < ty 

F(t) = [1-pF(0)] G(t) + pF(0) 
1-pF(0) 

F(-t) = es G(-t) 
1-pF(0) 

F(O) = eae) 


Using these relations it follows that 


G(O)(1-G(t) + G(-t)] = a EO) en F(-t) 
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and from the symmetry condition on F we get 
GO) 1 1=G(t)) 4° G(=e al == er all tO, 


with a change from F(0O) to G(0) = alas . In particular for 


F(O) v2; G(O) = (1=p)/2-p. A similar calculation for censoring when 


- > 0O shows that the symmetry condition holds for G(t) and 


G(O) = F(O)/1l-p + pF(O). For F(0O) = 1/2, G(O) = 1/2 + p. 
We shall now compare the expected number of observations needed to 
stop a process subject to random sampling using a Shewhart type control 


chart with the expected number needed using the procedure described 


X 


above. Consider a sequence of independent observations AD» 2 


with common continuous symmetric distribution F(x). Subjecting the 


X, to random censoring when X, <O we get from (6.9) the distribution 


of the censored observations Ch» Cy» as 
1-p 
5p 2F(t) eo 
Keck) P(C, Sb) = 
2F(t) - p an 
e-p 


We assume here that when p =O the process is in control and that 
when the process starts some fixed value of p, O<p<l is in effect. 
If p>0O we want to stop the process as quickly as possible. We con- 


Sider three procedures: 
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procedure 1 - when C. >b->O for the first time, stop the process 
procedure 2 - when Ic, | ~ b> 0 for-tneativst time, stop the process 


procedure 3 - when Is | - a >O for the first time, stop the process 


Procedures 1 and 2 are Shewhart type procedures and b is usually 

taken so that the probability of stopping at a particular stage is 

small when p =O. Procedure 3 is the signed sequential rank procedure 

previously described in this section. Define Py = er >b) and 

ay = P( Ic, | >b) assuming p =0. For each procedure the probability 
th 


of falling outside the control limit for the first time at the n 


observation is 
p l-p joel 2 


and B, (N) = 1/p,; E, (NN) = 1/P, are the expected number of observations 


taken before stopping. B3(N) = See and setting E, (N) = E, (N) = B3(N) 
we get 
2 
ee IS ris) = ly Fe 
By = 1-F(b) - Feb) = 28a iL) ee 
ory p= 0 Py = Bee > 1b) = 1-P(C, < by = 2F(-b) = od and 
2 (2-p) 3a 
oe Peaue)) 4 Lee i ae 
eee ye (CO) = —b)i a eee F(-b) = 2F(-b) = 1/3a 
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Thus for p > 0 


4 
am 


E, (N) 


Ep(N) = 1/p5 = 3a 


ah 1 
9g 4 

Cah 

e - 1 


PsN) = “aye 


and notice that since Ue es O)r= = < 1/2, it. follows) that) he or 


BE, (N) and E,(N) increase quadratically with a and E, is essentially 


z 


linear in a. For example 


900 1050 
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and procedure 2 is insensitive to values of p->OQO. The values of h 
corresponding to p =1, 3/4, 1/2, 1/4 are -» , -2.2, -.9, -.5 
respectively. 

The following tabulated results were obtained empirically to 
determine the effect of translation of the mean of the observations. 
We considered normal observations with mean wp and variance 1 and 


stopped sampling when Is,] >a for the first time where 
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and Y, is the signed sequential rank of ee M(p,1). For each 
parameter pair (a,p) twenty trials were performed except for pw = .1, 
Cc, .-3 where fifty trials were used. Sample averages, sample variances 


and sample standard deviations for termination time N are given. 








seer a = 20 
n N s s N s s 
ml 180.78 13449 .27 115397 364.56 31279 43 feces 
22 iol. 70 4306 .46 57.50 179.04 Sor seiko Bi S85, 
5 69.95 O.. 12 26.64 LO Ae a alo Sf sou 
4 52.55 324.99 18 .02 108.65 1160 .87 34.07 
a5 42.25 139.14 11.79 ees bloraee 19.16 
ae) 39.60 Lea! Oni ie (orl 72.70 171.69 Deo 
a Denes 128.05 e.g ey pels: 130.26 iaee an 
26 28.70 31.48 pie 0 62.05 D5 .om Oa 
9 28.00 Boao Cals SRO C75) 8.20 
Bre) 28.80 29.64 pee 52.25 59 6.30 
ie 25.40 7.95 Zeol 4h £00 26.94 so ike, 
20 22.40 4.98 2.23 42 45 12.05 ely 
2.5 20.90 5.25 2.29 41.55 9.20 3.03 
B20 265 7 cate. 215 40.95 lgRer' Sere 
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(. Summary and Conclusions. We remarked in the introduction on 
the paucity of nonparametric sequential procedures, particularly those 
based on ranks of observations. The author feels that the absence of 
a natural way of assigning ranks to observations, as the observations 
are taken, without reranking, was a significant cause for the lack of 
such procedures. The sequential ranking schemes defined and studied in 
this dissertation provide us with methods whereby ranks may be assigned 
in just such a manner. 

In order to use the methods of sequential parametric hypothesis 
testing (Wald's sequential probability ratio test) in our nonparametric 


X 


setting, we must replace the sequence of observations X >? 


ee 


R 


Elo: and base the test on the provabalicy, 


by a sequence of ranks Ry, 
ratio of the ranks. This can be done by the sequential ranking scheme 
defined in Section 43. One basic nonparametric problem is the two- 
sample problem where we must decide whether or not an X- population 
and a Y- population have the same probability distribution. This 
problem was treated in Section 4 in the special case where the alter- 
natives are of the form proposed by Lehmann [1]. However the method 
proposed in Section 4 is general in the sense that in order to carry 
out the test one must only be able to compute P(U, < U, yeaa Uy) 
where the U's are X's and Y's. In general this computation is 
dificult, but for special alternatives waere™ the vcompucation is tea= 
sible, the method in Section 4 applies directly. 

Notice that in the finite sample size problem nothing is sacrificed 


by ranking sequentially (Theorem 4.1) instead of using ordinary ranks. 


In fact a little is gained inasmuch as the sequential ranks may be 
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viewed as a transformation of the dependent ordinary ranks into the 
independent sequential ranks. 

Merely ranking observations tells us nothing of their location, 
except relative to each other. In order to take into account the 
location of each observation relative to the origin as well as its 
size (absolute value) and relative location, the method of signed 
sequential ranking was devised. Contrary to sequential ranks, signed 
sequential ranks obtained from independent identically distributed 
observations are not independent in general. A sufficient condition 
on the distribution of the observations is given in Theorem 5.1 to 
insure that the signed sequential ranks will be independent. In the 
process control problem we used signed sequential ranks of observations 
whose distributions satisfied this condition. This simplified the 
calculations since sums of independent random variables were involved 
in the analysis. 

The methods of sequential ranking and signed sequential ranking 
proposed in this dissertation are new, as far as the author can deter- 
mine, and provide a natural way of assigning ranks to observations 
which fits into the theory of sequential analysis (hypothesis testing) 
and sequential procedures (process control). All the attendant distri- 
bution theory results are new and the condition of Theorem 5.1 which 
insures the independence of signed sequential ranks is the only one 
known to the author. 

There are many areas for further investigation suggested by this 
research. In the sequential probability ratio test of Section 4 we 
did not use the sequential ranks explicitly (except for Z in equation 


(4.2)) in the definition of the probability ratio Sy: Sy can be 


(0 





written in terms of (Z, , Z Z, 


59 eS tfiessequential ranks, jb ene 


by, 
expression is quite complicated and it is much more convenient to use 
(4.1) and (4.2) which incorporate the most recent sequential rank only. 
Thus the behavior of Oy was obtained by reference to A; Ay» one Ay. 
More general results are needed as to the probability of termination of 
BHO (Zn) for alternatives other than Lehmann alternatives. This 
is necessary because under the alternative hypothesis the sequential 
ranks are not independent generally and the conservative approximations 
A = ieee) leete memaine valid stor ysuccessive dependent observarioncenwnen che 
probability is one that the procedure will ultimately terminate. 

A second area for further study is the evaluation of the rule 
given in Section 6 for process control problems when changes from F 
to G are not of the form presented (e.g. G(x) = F(x pa A) A>O). 


Also there are other ad hoc rules which could be proposed using signed 


sequential ranks (or sequential ranks) in process control problems. 
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